Models


openai

GPT 4o mini

(gpt-4o-mini)

Cheapest
GPT 4o miniBy OpenAI
GPT-4o mini is OpenAI's newest model after GPT-4 Omni, supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable than other recent frontier models, and more than 60% cheaper than GPT-3.5 Turbo. It maintains SOTA intelligence, while being significantly more cost-effective. GPT-4o mini achieves an 82% score on MMLU and presently ranks higher than GPT-4 on chat preferences common leaderboards. Check out the launch announcement to learn more.
$0.15Input(Per Million)
$0.60Output(Per Million)
128KContext Window
openai

GPT 4o mini (2024 07 18)

(gpt-4o-mini-2024-07-18)

Cheapest
GPT 4o mini (2024 07 18)By OpenAI
GPT-4o mini is OpenAI's newest model after GPT-4 Omni, supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable than other recent frontier models, and more than 60% cheaper than GPT-3.5 Turbo. It maintains SOTA intelligence, while being significantly more cost-effective. GPT-4o mini achieves an 82% score on MMLU and presently ranks higher than GPT-4 on chat preferences common leaderboards. Check out the launch announcement to learn more.
$0.15Input(Per Million)
$0.60Output(Per Million)
128KContext Window
openai

GPT 4o

(gpt-4o)

Mid
GPT 4oBy OpenAI
GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of GPT-4 Turbo while being twice as fast and 50% more cost-effective. GPT-4o also offers improved performance in processing non-English languages and enhanced visual capabilities. For benchmarking against other models, it was briefly called "im-also-a-good-gpt2-chatbot"
$2.50Input(Per Million)
$10.00Output(Per Million)
128KContext Window
openai

GPT 4o (2024 08 06)

(gpt-4o-2024-08-06)

Mid
GPT 4o (2024 08 06)By OpenAI
The 2024-08-06 version of GPT-4o offers improved performance in structured outputs, with the ability to supply a JSON schema in the respone_format. Read more here: https://openai.com/index/introducing-structured-outputs-in-the-api/. GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of GPT-4 Turbo while being twice as fast and 50% more cost-effective. GPT-4o also offers improved performance in processing non-English languages and enhanced visual capabilities. For benchmarking against other models, it was briefly called "im-also-a-good-gpt2-chatbot"
$2.50Input(Per Million)
$10.00Output(Per Million)
128KContext Window
openai

ChatGPT 4o

(chatgpt-4o-latest)

Expensive
ChatGPT 4oBy OpenAI
OpenAI ChatGPT 4o is continually updated by OpenAI to point to the current version of GPT-4o used by ChatGPT. It therefore differs slightly from the API version of GPT-4o in that it has additional RLHF. It is intended for research and evaluation. OpenAI notes that this model is not suited for production use-cases as it may be removed or redirected to another model in the future.
$5.00Input(Per Million)
$15.00Output(Per Million)
128KContext Window
openai

GPT 4o (2024 11 20)

(gpt-4o-2024-11-20)

Mid
GPT 4o (2024 11 20)By OpenAI
The 2024-11-20 version of GPT-4o offers a leveled-up creative writing ability with more natural, engaging, and tailored writing to improve relevance & readability. It's also better at working with uploaded files, providing deeper insights & more thorough responses. GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of GPT-4 Turbo while being twice as fast and 50% more cost-effective. GPT-4o also offers improved performance in processing non-English languages and enhanced visual capabilities.
$2.50Input(Per Million)
$10.00Output(Per Million)
128KContext Window
openai

GPT 4.1

(gpt-4.1)

Mid
GPT 4.1By OpenAI
GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and GPT-4.5 across coding (54.6% SWE-bench Verified), instruction compliance (87.4% IFEval), and multimodal understanding benchmarks. It is tuned for precise code diffs, agent reliability, and high recall in large document contexts, making it ideal for agents, IDE tooling, and enterprise knowledge retrieval.
$2.00Input(Per Million)
$8.00Output(Per Million)
1MContext Window
openai

GPT 4.1 Mini

(gpt-4.1-mini)

Cheapest
GPT 4.1 MiniBy OpenAI
GPT-4.1 Mini is a mid-sized model delivering performance competitive with GPT-4o at substantially lower latency and cost. It retains a 1 million token context window and scores 45.1% on hard instruction evals, 35.8% on MultiChallenge, and 84.1% on IFEval. Mini also shows strong coding ability (e.g., 31.6% on Aider's polyglot diff benchmark) and vision understanding, making it suitable for interactive applications with tight performance constraints.
$0.40Input(Per Million)
$1.60Output(Per Million)
1MContext Window
openai

GPT 4.1 Nano

(gpt-4.1-nano)

Cheapest
GPT 4.1 NanoBy OpenAI
For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million token context window, and scores 80.1% on MMLU, 50.3% on GPQA, and 9.8% on Aider polyglot coding – even higher than GPT‑4o mini. It's ideal for tasks like classification or autocompletion.
$0.10Input(Per Million)
$0.40Output(Per Million)
1MContext Window
openai

GPT 3.5 Turbo

(gpt-3.5-turbo)

Cheapest
GPT 3.5 TurboBy OpenAI
GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.
$0.50Input(Per Million)
$1.50Output(Per Million)
16.4KContext Window
openai

o1

(o1)

Expensive
o1By OpenAI
The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before responding. The o1 model series is trained with large-scale reinforcement learning to reason using chain of thought. The o1 models are optimized for math, science, programming, and other STEM-related tasks. They consistently exhibit PhD-level accuracy on benchmarks in physics, chemistry, and biology.
$15.00Input(Per Million)
$60.00Output(Per Million)
200KContext Window
openai

o3

(o3)

Expensive
o3By OpenAI
o3 is a well-rounded and powerful model across domains. It sets a new standard for math, science, coding, and visual reasoning tasks. It also excels at technical writing and instruction-following. Use it to think through multi-step problems that involve analysis across text, code, and images. Note that BYOK is required for this model.
$10.00Input(Per Million)
$40.00Output(Per Million)
200KContext Window
openai

o3 Mini

(o3-mini)

Cheapest
o3 MiniBy OpenAI
OpenAI o3-mini is a cost-efficient language model optimized for STEM reasoning tasks, particularly excelling in science, mathematics, and coding. This model supports the `reasoning_effort` parameter, which can be set to "high", "medium", or "low" to control the thinking time of the model. The default is "medium". OpenRouter also offers the model slug `openai/o3-mini-high` to default the parameter to "high". The model features three adjustable reasoning effort levels and supports key developer capabilities including function calling, structured outputs, and streaming, though it does not include vision processing capabilities. The model demonstrates significant improvements over its predecessor, with expert testers preferring its responses 56% of the time and noting a 39% reduction in major errors on complex questions. With medium reasoning effort settings, o3-mini matches the performance of the larger o1 model on challenging reasoning evaluations like AIME and GPQA, while maintaining lower latency and cost.
$1.10Input(Per Million)
$4.40Output(Per Million)
200KContext Window
openai

o4 Mini

(o4-mini)

Cheapest
o4 MiniBy OpenAI
OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining strong multimodal and agentic capabilities. It supports tool use and demonstrates competitive reasoning and coding performance across benchmarks like AIME (99.5% with Python) and SWE-bench, outperforming its predecessor o3-mini and even approaching o3 in some domains. Despite its smaller size, o4-mini exhibits high accuracy in STEM tasks, visual problem solving (e.g., MathVista, MMMU), and code editing. It is especially well-suited for high-throughput scenarios where latency or cost is critical. Thanks to its efficient architecture and refined reinforcement learning training, o4-mini can chain tools, generate structured outputs, and solve multi-step tasks with minimal delay—often in under a minute.
$1.10Input(Per Million)
$4.40Output(Per Million)
200KContext Window
openai

Codex Mini

(codex-mini-latest)

Mid
Codex MiniBy OpenAI
codex-mini-latest is a fine-tuned version of o4-mini specifically for use in Codex CLI. For direct use in the API, we recommend starting with gpt-4.1.
$1.50Input(Per Million)
$6.00Output(Per Million)
200KContext Window
openai

GPT 4 Turbo

(gpt-4-turbo)

Expensive
GPT 4 TurboBy OpenAI
The latest GPT-4 Turbo model with vision capabilities. Vision requests can now use JSON mode and function calling. Training data: up to December 2023.
$10.00Input(Per Million)
$30.00Output(Per Million)
128KContext Window
openai

GPT 4o Search Preview

(gpt-4o-search-preview)

Mid
GPT 4o Search PreviewBy OpenAI
GPT-4o Search Preview is a specialized model for web search in Chat Completions. It is trained to understand and execute web search queries.
$2.50Input(Per Million)
$10.00Output(Per Million)
128KContext Window
openai

GPT 4.5 (Preview)

(gpt-4.5-preview)

Expensive
GPT 4.5 (Preview)By OpenAI
GPT-4.5 (Preview) is a research preview of OpenAI's latest language model, designed to advance capabilities in reasoning, creativity, and multi-turn conversation. It builds on previous iterations with improvements in world knowledge, contextual coherence, and the ability to follow user intent more effectively. The model demonstrates enhanced performance in tasks that require open-ended thinking, problem-solving, and communication. Early testing suggests it is better at generating nuanced responses, maintaining long-context coherence, and reducing hallucinations compared to earlier versions. This research preview is intended to help evaluate GPT-4.5's strengths and limitations in real-world use cases as OpenAI continues to refine and develop future models.
$75.00Input(Per Million)
$150.00Output(Per Million)
128KContext Window
openai

GPT 3.5 Turbo 16k

(gpt-3.5-turbo-0125)

Cheapest
GPT 3.5 Turbo 16kBy OpenAI
The latest GPT-3.5 Turbo model with improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more. Training data: up to Sep 2021. This version has a higher accuracy at responding in requested formats and a fix for a bug which caused a text encoding issue for non-English language function calls.
$0.50Input(Per Million)
$1.50Output(Per Million)
16.4KContext Window
openai

o1 mini

(o1-mini)

Cheapest
o1 miniBy OpenAI
The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before responding. The o1 models are optimized for math, science, programming, and other STEM-related tasks. They consistently exhibit PhD-level accuracy on benchmarks in physics, chemistry, and biology. Learn more in the launch announcement. Note: This model is currently experimental and not suitable for production use-cases, and may be heavily rate-limited.
$1.10Input(Per Million)
$4.40Output(Per Million)
128KContext Window
...
Showing 1-20 of 265 models